Template-driven Emotions Generation in Malay Text-to-Speech: A Preliminary Experiment
نویسندگان
چکیده
This paper describes the pilot experiment conducted for the purpose of adding an affective component to the first Malay Text-to-Speech (TTS) system, Fasih. The aim is to test a new method of generating an expressive speech via a template-driven system based on diphones as the basic sound. The synthesized expressive speech can express four types of emotion. However, as an initial test the pilot experiment focused on anger and sadness. The results from this test show an impressive recognition rate of over 60% for the synthesized speech of both emotions. The pilot experiment has paved the way for the development of an emotions filter to be embedded into Fasih, thus allowing for the possibility of generating an unrestricted Malay
منابع مشابه
Adding Emotions to Malay Synthesized Speech Using Diphone-based Templates
This paper concerns the addition of an affective component to Fasih, one of the first Malay Textto-Speech systems developed by MIMOS Berhad. The goal is to introduce a new method of incorporating emotions to Fasih by building an emotions filter that is template-driven. The templates are diphone-based emotional templates that can portray four types of emotions, i.e. anger, sadness, happiness and...
متن کاملeXTRA: A Culturally Enriched Malay Text to Speech System
This paper concerns the incorporation of naturalness into Malay Text-to-Speech (TTS) systems through the addition of a culturally-localized affective component. Previous studies on emotion theories were examined to draw up assumptions about emotions. These studies also include the findings from observations by anthropologists and researchers on culturalspecific emotions, particularly, the Malay...
متن کاملIntegrating rule and template-based approaches for emotional Malay speech synthesis
The manipulation of prosody, including pitch, duration and intensity, is one of the leading approaches in synthesizing emotion. This paper reports work on the development of a Malay Emotional synthesizer capable of expressing four basic emotions, namely happiness, anger, sadness and fear for any form of text input with various intonation patterns using the prosody manipulation principle. The sy...
متن کاملTemplate-driven generation of prosodic information for Chinese concatenative synthesis
In this paper, a template-driven generation of prosodic information is proposed for Chinese text-to-speech conversion. A set of monosyllable-based synthesis units is selected from a large continuous speech database. The speech database is employed to establish a word-prosody-based template tree according to the linguistic features: tone combination, word length, part-of-speech (POS) of the word...
متن کاملStatistical Parametric Evaluation on New Corpus Design for Malay Speech Articulation Disorder Early Diagnosis
Corresponding Author: Tan Tian Swee Medical Implant Technology Group (MediTEG), Cardiovascular Engineering Center, Material Manufacturing Research Alliance (MMRA), Faculty of Biosciences and Medical Engineering, Universiti Teknologi Malaysia, Malaysia Email: [email protected] Abstract: Speech-to-Text or always been known as speech recognition plays an important role nowadays especially...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005